Abstract

Accurately predicting the price of agricultural commodity is very important for evading market risk, increasing agricultural income, and accomplishing government macroeconomic regulation. With the price index predictions of 6 commodities of Food and Agriculture Organization of the United Nations (FAO) as examples, this paper proposed a novel agricultural commodity price forecasting model which combined the fuzzy information granulation, mind evolutionary algorithm (MEA), and support vector machine (SVM). Firstly, the time series data of agricultural commodity price index was transformed into fuzzy information granulation particles made up of Low, , and Up, which represented the trend and magnitude of price movement. Secondly, MEA algorithm was employed to seek the optimal parameters and for SVM to establish the MEA-SVM model. Finally, FOA price index fluctuation range and change trend in the future were predicted by the MEA-SVM model. The empirical analysis showed that the MEA-SVM model was effective and had higher prediction accuracy and faster calculation speed in the forecasting of agricultural commodity price.

1. Introduction

The scientific and effective forecasting method is conducive to correctly guide agricultural production, realize the balance of supply and demand of agricultural commodities, increase peasants’ income, and provide decision-making basis for the government to adjust agricultural economic structure and implement targeted macro-control [13].

Nowadays, the agricultural price forecasting methods can be divided into two categories: qualitative forecasting methods and quantitative forecasting methods. As a supplement to other agricultural commodity price forecasting methods, qualitative forecasting methods generally do not occupy the mainstream position, with low accuracy and high subjectivity. According to the sequence of occurrence time, the quantitative prediction methods can be divided into econometric method, time series analysis method, and intelligent prediction method. The econometric method finds the supporting economic theory according to the research problem, then puts forward the hypothesis, and establishes the econometric model to verify the hypothesis [49]. However, most empirical studies do not prove that the prediction effect of the classic econometric model is better than that of the time series analysis method [1012]. Therefore, the time series analysis method gradually replaced the econometric method in the 1990s. Due to the complexity and difficulty of price prediction of agricultural commodities, price fluctuations always show the characteristics of repeated ups and downs, instability, benign or vicious circle, etc. The advantages of self-adaptation, self-learning, and self-organization possessed by the intelligent prediction method can well adapt to the characteristics of market price fluctuations of agricultural commodities. Therefore, in recent years, the intelligent prediction method has been increasingly applied to the prediction of agricultural price [1317]. Intelligent prediction methods usually include artificial neural network, chaos theory, entropy analysis, extreme learning machines, radial basis function, and support vector regression.

The deficiency of the above prediction methods is to use a single model to predict the time series data of agricultural commodities. Meanwhile, most forecasting methods provide a point estimation rather than a value range. In the real world, peasants and governments pay more attention to the fluctuation range and trend of agricultural prices. Our study firstly transforms time series data of prices index into fuzzy information granulation particles made up of Low, , and Up, which represent the trend and magnitude of price movements, then employs a hybrid method that combines the support vector machine (SVM) with mind evolutionary algorithm (MEA) to predict the fuzzy information granulation particles Low, , and Up, and finally captures the fluctuation range and change trend exhibited in agricultural commodity futures prices, contributing to existing studies and filling the research gap.

Besides, if the SVM model wants to achieve a relatively ideal regression effect, it must set and adjust relevant parameters; the most important parameters are the penalty parameter and the kernel function parameter . Previous studies applied grid search method (GSM) to seek the optimal parameters and of SVM [1821]. Although GSM can obtain the global optimal solution, that is, the optimal regression accuracy, it is usually conducted within the specified range. Once the search range is expanded, the algorithm training time will be very long [2224]. To further improve the optimization performance and overcome the deficiencies of traditional algorithms, the Chinese scholar Sun Cheng-Yi et al. raised the mind evolutionary algorithm (MEA) in 1998 [25]. MEA is an evolutionary algorithm simulating the progress of human mind and has the positive and negative feedback mechanism, wherein the positive feedback mechanism improves toward being more beneficial to the population survival to consolidate and develop the evolution achievement. The negative feedback mechanism prevents the algorithm premature to avoid that the algorithm is caught in local optimal solution. The structural parallelism of MEA guarantees the high search efficiency of the algorithm, overcomes the defects of traditional evolutionary algorithms such as time-consuming computation and premature, and also has extreme robustness on interference [26, 27]. Although the MEA has been applied in some technical fields [2830], there are little credible researches employing MEA to optimize the parameters of the SVM model. This study utilized MEA to optimize the penalty parameter and the kernel function parameter of SVM and established the novel MEA-SVM model, which provides a methodological perspective for the agricultural price forecasting, extending the prior researches.

Taking the price indexes of Food and Agriculture Organization of the United Nations (FAO) as examples, this study took 6 consecutive trading months as one group, adopted fuzzy information granulation theory to map the time series data into 3 fuzzy information particles with low boundary value Low, median value , and high boundary value Up, then applied MEA to search for the optimal parameters and of the SVM model, established the MEA-SVM model, and finally employed the hybrid MEA-SVM model to predict the lowest, average, and highest prices of the FAO price index in the future period, so as to obtain the future fluctuation interval and variation trend of the FAO price index.

The rest of this article is organized as follows. The fuzzy information granulation theory, SVM model, and MEA are introduced, and the MEA-SVM model is constructed in Section 2. The empirical analysis is given, six agricultural commodity price indexes are predicted, and the performance of MEA-SVM model and GSM-SVM model is compared in Section 3. At last, conclusions are summarized and future research is suggested in Section 4.

2. Model Construction

2.1. Fuzzy Information Granulation Theory

The concept of information granulation was first proposed by American scholar Zadeh [31]. Information granules are collections of objects, which are combined together due to functional similarity, proximity, indivisibility, and functionality. Granulation calculation is a new computing paradigm covering granulation theory, technology, method, and tools. It integrates computing theory, entropy space theory, rough set theory, and interval calculation and is one of the hot spots in the field of artificial intelligence and soft computing science.

Information granulation methods mainly include rough set information granulation theory, fuzzy set information granulation theory, and entropy space information granulation theory. Wherein, the information grains represented by fuzzy sets are called fuzzy information grains. When the fuzzy set method granulates the time series data, it mainly includes two steps: window division and fuzzy processing. The original time sequence is divided into several subsequences, which is called window division; the divided subwindows are fuzzified to generate fuzzy sets, which is called fuzzification. The combination of the two treatments is fuzzy information granulation, also known as granulation. Assuming that the time series data is regarded as one window, the task of fuzzy processing is to establish a fuzzy information particle , which can replace the original window data. In other words, the fuzzy concept (fuzzy set with as the theoretical domain) is determined reasonably so as to describe [32, 33].

Therefore, the essence of fuzzy process is the process of determining the membership function of the fuzzy concept . The general membership functions of fuzzy particles include triangle, parabola, Gaussian, and trapezoid. In this study, triangular fuzzy particles are adopted and their membership functions are shown as follows.

In the formula, the original time series data is represented by , and the minimum, average, and maximum values of the original data are, respectively, described by fuzzy particle parameters , , and .

2.2. Support Vector Machine

The support vector machine (SVM) model was first proposed by Vapnik in 1995 [34]. It can be used for both pattern classification and nonlinear regression analysis. SVM model is a supervised learning algorithm based on the structure risk minimum principle and statistical Vapnik-Chervonenkis dimension theory. The main idea of SVM model is to transform the nonlinear classification problem into a multidimensional linear convex quadratic programming problem through kernel function mapping and establishment of classification higher-dimensional decision surface. The complexity of SVM algorithm is independent from the sample dimension. It has the advantages of universality, robustness, validity, simple calculation, perfect theory, etc. The obtained solution is unique and globally optimal, which avoids the problem that the neural network method falls into local extreme value [3537].

In the case of linear separability, the core of SVM algorithm is to find the optimal classification decision plane. It is assumed that the sample data set of linear separable training is .

In the formula, , , .

The construction of the secondary planning problem is expressed as follows.

The classification surface determined by the optimal solution is expressed as follows.

The decision function is constructed as follows.

SVM model is characterized by sparsity and uses a few support vectors to represent decision functions. In order to guarantee this important characteristic, when SVM algorithm is applied to solve nonlinear regression problems, the loss function needs to be introduced. Taking the standard insensitive loss function as an example, the form of the -SVM model is as follows.

In the formula, and are relaxation variables and represent the model error requirements; C is the penalty parameter, which is used to measure the degree of matching between empirical risk and confidence range.

2.3. Mind Evolutionary Algorithm

Mind Evolutionary Algorithm (MEA) is a kind of evolutionary algorithm simulating the progress of human mind, proposed for the defects of genetic algorithm (GA). The MEA uses two types of operation: similar taxis and dissimilation, and it uses population optimizing instead of individual optimizing. The similar taxis operation is a process in which the individual competes to be a winner, which happens within the scope of subpopulation. The dissimilation operation is the process in which a subpopulation battles to be a winner and continuously explores the new point of solution space, which happens within the whole solution space. In the course of the algorithm running, the similar taxis and dissimilation operation is executed repeatedly until the terminating condition is satisfied [25].

Compared with GA, the MEA has the following advantages: The crossover and mutation operations of GA generate not only superior genes, but also inferior destructive genes, and those operations have duality, but the MEA uses the similar taxis and dissimilation operations, which amend the defects of GA. The similar taxis and dissimilation operations of MEA are coordinated mutually but also are independent mutually, and any improvement on any aspect will raise the algorithm’s prediction accuracy. The similar taxis and dissimilation operations have parallelism on structure, which boosts the algorithm’s search efficiency and computation speed. The MEA divides the populations into superior subpopulations and temporary subpopulations, which can memorize evolutionary information more than one generation [38, 39].

The MEA follows some basic concepts of the GA such as “population,” “individual,” and “environment”, but, meanwhile, it also adds some new concepts.

2.3.1. Population and Subpopulation

The MEA is a kind of learning method making optimization through iteration, and all individuals in every generation of the evolutionary process gather into one population. A population is divided into several subpopulations. The subpopulation contains two classes: superior subpopulation and temporary subpopulation. The superior subpopulation records the information of the winners in the global competition, and the temporary subpopulation documents the process of the global competition.

2.3.2. Billboard

The billboard is equivalent to an information platform, which provides chances of information communication between the individuals or the subpopulations. The billboard records three types of useful information: the serial number of individual or subpopulation, the action, and the score. By utilizing the serial number of individual or subpopulation, it is convenient to distinguish different individuals or subpopulations; the description of action varies from different research fields, and since this article is researching the problem of parameter optimization, and the action is used to record the exact position of the individual and subpopulation, the score is the evaluation of environment on the individual action, and in the optimization process by utilizing the MEA, it can rapidly find out the optimized individuals and populations only if the scores of every individual and subpopulation are recorded all the time. The individuals in the subpopulation post up their own information on the local billboard, and the information of each subpopulation is posted up on the global billboard.

2.3.3. Similar Taxis

Within the scope of subpopulation, the process that an individual competes to be a winner is called similar taxis. In the process of a subpopulation’s similar taxis, if a new winner cannot be generated, it means that such subpopulation has matured. When a subpopulation matures, the similar taxis process of such subpopulation comes to an end. The period of a subpopulation from its birth to maturity is called the lifetime.

2.3.4. Dissimilation

In the whole solution space, each subpopulation competes to be a winner and continuously explores new solution space point; this process is known as dissimilation. The dissimilation has two definitions: each subpopulation makes global competition, and if the score of a temporary subpopulation is higher than the score of a specific matured superior subpopulation, such superior subpopulation will be replaced by the winning temporary subpopulation, and the individuals of the original superior subpopulation will be released; if the score of a matured temporary subpopulation is lower than the score of any superior subpopulation, such temporary subpopulation will be abandoned, and the individuals therein will be released; the released individuals will research and form new temporary subpopulation.

2.3.5. MEA-SVM Model

In this study, the prediction accuracy of training set was taken as the fitness function, and MEA was applied to seek the best penalty parameter and kernel function parameter of SVM model. The overall process of the algorithm is shown in Figure 1.

The main forecasting process of agricultural commodity price based on fuzzy information granulation and MEA-SVM model is described below.

Step 1. Extract time series data of agricultural commodity price index.

Step 2. The triangular fuzzy information granulation theory is applied to process the time series data of the original agricultural commodity price index, and three fuzzy information particles representing the lowest, average, and highest prices of the original agricultural commodity price are obtained.

Step 3. MEA was adopted to search for the optimal parameters and of SVM model.

Step 4. The hybrid MEA-SVM model is applied to predict the agricultural commodity price index for 1-6 and 7-12 months in the future.

Step 5. Verify and evaluate the predicted results.

2.4. Model Assessment Criteria

In this study, mean squared error (MSE), squared correlation coefficient (), and computation time (second) are employed to measure the prediction accuracy and computation speed of MEA-SVM model. The calculation formulas of MSE and are as follows:

3. Empirical Analysis

3.1. Data Sources

This study selected the time series data of FAO price index (http://www.fao.org/home/en/) as the empirical research object. The FAO food price index is a measure of the monthly change in international prices of a basket of food commodities. It consists of the average of six commodity group price indexes (Food Price Index, Cereal Price Index, Vegetable Oil Price Index, Meat Price Index, Dairy Price Index, and Sugar Price Index). The MEA-SVM model takes the time series data of price index of six commodities from January 1990 to June 2017 as training data and the time series data from July 2017 to June 2018 as testing data. As an example, the FAO food price index from January 1990 to June 2017 is shown in Figure 2. Based on this, the highest price, lowest price, average price, and the variation trend of the FAO food price index from July 2017 to June 2018 will be predicted.

3.2. Fuzzy Information Granulation

The time series data of 330 months from January 1990 to July 2017 were regarded as the training set and every 6 months was treated as one information granulation window. Finally, the FAO food price index is transformed into 55 fuzzy information particles, each containing 3 parameters of Low, , and Up, as shown in Figure 3. For one single fuzzy information particle, parameters Low, , and Up describe the minimum, mean, and maximum values of original data changes, respectively.

3.3. Regression Prediction of Granulation Data

The hybrid MEA-SVM model was employed to predict the Low, , and Up. Firstly, the granulation data was preprocessed, then MEA was applied to search the best penalty parameter and kernel function parameter for establishing the MEA-SVM model. Finally, the MEA-SVM model was used for training and prediction. For example, the prediction process of parameter Low is as follows.

3.3.1. Data Preprocessing

In this study, the granulation data of Low was normalized and data normalization range was , as shown in Figure 4.

3.3.2. Searching for the Optimal Parameters c and g

In this study, MEA was adopted to search for the best parameters and of SVM model. The population size was set to 200, the numbers of superior population and temporary population size were set to 5, subpopulation size was set to 20, and evolution generations are set to 200. The minimum-value and maximum-value of were 500 and 10. The minimum-value and maximum-value of were 0.01 and 1. The minimum-value and maximum-value of were 0.01 and 100.

The optimal parameter search processes of MEA for granulated data Low were shown in Figures 5 and 6.

In order to verify the superiority of MEA, GSM was also employed to search the optimal parameters and of SVM model on the basis of the same parameter settings. The GSM first looks for the optimal parameters roughly (Figure 7) and then delicately seeks for the optimal parameter after observing the cursory search results (Figure 8).

In Figures 7 and 8, the X-axis represents the log base 2 of , the -axis represents the log base 2 of , the contour line represents the MSE corresponding to and .

Finally, the optimal parameters obtained by the MEA and GSM for granulation data Low of six FAO price indexes are shown in Table 1.

3.3.3. Model Training

The optimal parameters and were obtained and then substituted into the SVM model to establish the MEA-SVM model. The lowest, average, and highest prices of FAO food price index from January 1990 to June 2017 were predicted by using the MEA-SVM model.

The predicted values, actual values, and relative errors obtained by MEA-SVM for granulation data Low of FAO food price index were shown in Figures 9 and 10. Meanwhile, the performance of MEA-SVM in predicting granulation data Low is compared with GSM-SVM, as shown in Table 2.

In Table 2, the lower the MSE and the higher the , the more accurate the model’s prediction. Excluding oils price index, the comparison results of the other five agricultural commodity price indexes show that the prediction accuracy of the MEA is superior to the GSM. At the same time, the calculation speed is improved.

3.3.4. Model Prediction

The 6-month FAO price index of granulation data Low is regarded as one period, and the fluctuation range and change trend of the 12-month FAO food price index in the next two periods are predicted. By repeating the above data preprocessing, searching for the optimal parameters and , model training and model prediction for the granulation data of and Up, the predicted values of 3 fuzzy particles Low, , and Up in the next two periods were finally obtained (Table 3).

3.4. Predictive Performance Evaluation

The actual price indexes of six commodities from July 2017 to June 2018 were compared with the predicted price indexes to test the prediction effect of fuzzy information granulation and MEA-SVM model (Tables 4 and 5). For instance, from July 2017 to December 2017, the highest price, average price, and lowest price of FAO food price index are 144.4461, 150.3121, and 152.9865, respectively, and the corresponding predicted values are 147.3770, 145.7625, and 143.4025, with the relative prediction errors of 2.03%, -3.03%, and -6.26%.

In terms of mean value of the price index, the predictions of cereals price index and meat price index are the most accurate; the dairy price index has the biggest forecasting error. For many FOA price indexes, the predicted values of granulation data Low, , or Up are lower than the actual observed values, and there are risks of price underestimation. Overall, the prediction error of the second period is higher than that of the first period, so the further the prediction period is, the larger the prediction error is. The minimum and maximum values of the mean prediction relative errors are -1.40% and -20.95% in the first period and -2.03% and -27.49% in the second period. The prediction of the future fluctuation range of FAO price index time series data by fuzzy information granulation and MEA-SVM model is generally accurate.

As for the forecast of price indexes change trend, the price of the second period is lower than that of the first period, which is consistent with the observation of FAO price indexes. The prediction of future change trend of FAO price index time series data by fuzzy information granulation and MEA-SVM model is correct.

4. Conclusion and Discussion

Many uncertainties and random factors exist in agricultural commodity market. Therefore, it is difficult to forecast the agricultural commodity price. In this study, the time series data of agricultural commodity price indexes were granulated by triangular fuzzy information granulation theory. Then the MEA was applied to seek the optimal parameters and to establish the hybrid MEA-SVM model. Finally, the trained MEA-SVM model was used to predict the fluctuation range and change trend of agricultural commodity price index in the future. The empirical results are as follows.

The fuzzy information granulation method can granulate the time series data of the original agricultural commodity price index and use the fuzzy information particles Low, , and Up to represent the lowest, average, and highest values of agricultural commodity price fluctuation.

The MEA can seek the global optimal parameters of SVM model in a wider range without having to traverse all the parameter points, which improves the calculation speed and prediction accuracy.

The MEA-SVM regression model based on fuzzy information granulation is more accurate and effective in predicting the fluctuation range and variation trend of agricultural commodity price index, which indicates that the model has certain advantages in nonlinear simulation, adaptive learning,and processing of incomplete complex information.

The MEA-SVM model can be generalized to predict the price fluctuations of other commodity and help to provide some reference for the time series data prediction in other fields. Nonetheless, the MEA-SVM model based on fuzzy information granulation still has some deficiencies, such as underestimation and the long-term prediction error being larger. In future studies, the penalty parameter c and kernel parameter g of SVM model can be further optimized through other intelligence algorithms to improve the prediction accuracy and stability.

Data Availability

Our research data comes from the Food and Agriculture Organization of the United Nations (FAO). The research data is authoritative and reliable. The FAO food price index is a measure of the monthly change in international prices of a basket of food commodities. It consists of the average of six commodity group price indexes (Food Price Index, Cereal Price Index, Vegetable Oil Price Index, Meat Price Index, Dairy Price Index, and Sugar Price Index). At the same time, the research data is public and can be accessed and obtained through the url http://www.fao.org/home/en/. Research data can be accessed and downloaded by anyone at any time.

Conflicts of Interest

The authors declare no conflicts of interest.

Acknowledgments

This paper performance was supported by Wonkwang University in 2018.